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ABSTRACT 

We present a mathematical method to statistically decouple the effects of unknown inclination angles 
on the mass distribution of exoplanets that have been discovered using radial-velocity techniques. The 
method is based on the distribution of the product of two random variables. Thus, if one assumes a 
true mass distribution, the method makes it possible to recover the observed distribution. We compare 
our prediction with available radial-velocity data. Assuming the true mass function is described by a 
power-law, the minimum mass function that we recover proves a good fit to the observed distribution 
at both mass ends. In particular, it provides an alternative explanation for the observed low-mass 
decline, usually explained as sample incompleteness. In addition, the peak observed near the the low- 
mass end arises naturally in the predicted distribution as a consequence of imposing a low-mass cutoff 
in the true-distribution. If the low-mass bins below 0.02Mj are complete, then the mass distribution 
in this regime is heavily affected by the small fraction of lowly inclined interlopers that are actually 
more massive companions. Finally, we also present evidence that the exoplanet mass distribution 
p 1 changes form towards low-mass, implying that a single power law may not adequately describe the 

sample population. 

Subject headings: (stars:) planetary systems, techniques: radial-velocities 

,H | Samples of exoplanetary systems are increasing rapidly thanks to new ground and space-based dedicated surveys, 
. thus enabling investigation of their statistical properties. One of these properties is the planetary mass distribution, 
a key aspect needed to understand the origin of exoplanets and its relation to the initial m ass function. Currently, 
radial- velocity (RV) detections (e.g. iMavor et al. 1983rlButler et al. 19961 : Uones et al. 20101 ) have provided the largest 
. sample of unconstrained systems. However, the RV technique does not provide masses directly because the line-of- 
■ sight inclinat ion angles, i, cann ot be measured unless complementary observations are carried out, for instance transit 
[ — \ photometry (|Henrv et al. 2000T) or astrometry (|Benedict et al. 2 006). Thus, all masses measured with this technique 
[ — . are indeed 'minimum' planet masses, M Q b s = Mt sinz, where Mr, the 'true' planet mass, is not known a priori. 

Understanding the true mass distribution rather than the minimum mass distribution will allow modelers to com- 
pare their mass distr i butions against a function that is free from one of the largest sources of uncertainty (see 
iMordasini et al. 20091 llda fe Lin 20051 ). Also, the sini degeneracy that plagues RV signals means we can never be 
^ ■ fully sure that any individual signature is planetary in nature from the Doppler data alone. This has consequences 
for a number of aspects of planetary, brown dwarf, and low-mass star studies that deal with inferences drawn from a 
RV dominated mass distribution. A prime example of this would be the proper location in mass of the planet-brown 
dwarf boundary (see Sahl mann et al. 20111) . which allows one to clarify the status of an object as either a planet or 
Jen 



a brown dwarf (e.g. I Jenkins et al. 2009al ). and will help us to better understand the formation mechanisms of both 
classes of objects. 

It is thought that th e sinz correction is of order unity and would preserve the power- law sh ape of the observed 
mass distribution (e.g., Uorissen et al. 20011 ITabachni k fc Tremaine 20021 iHubbard et al. 20071 Morton & Johnson 
2011). However, no proof has been provided to substantiate this. Methods to recover the Mr distribution from the 
observed minimum-mass data have been proposed based on: (1) nume rically solving an Abel-type integral equation 
that relates observed and true mass distributions (jJorissen et al. 2001f): (2) analytically fi nding the distribution that 



maximizes the likelihood of having a given set of minimum masses ( Zuc ker fe Mazeh 20011); (3) com paring cumulative 
distributions of projected and de-projected data with non-para metric statistical tools dBrown 201 If ): and (4) using the 
physics of multi-planet systems to resolve the sini correction (jBatvgin fc Laughlin 20 111 ). With the exception of (4), 
which is a theoretical prediction, these methods are non-parametric, i.e., they do not assume an a priori model of the 
data. However, they also suffer from some drawbacks like the need to smooth the data (1 and 3), or the complexity 
of introducing observational limits (2). 

In this paper we present an alternative method to statistically decouple the sin i dependence in observed exoplanet 
mass functions. The method is based on the expected distribution of the product of two continuous and independent 
random variables. Its parametric nature requires an assumption on the shape of the underlying (Mr) distribution; 
but, on the other hand, it offers the possibility of introducing observational constraints in a straightforward fashion. 
The mathematical problem, applied to planet mass distributions, is stated in § [2] and its solution presented in § [3] 
In § |4] the method is implemented on two example distributions and in § [5] we make a comparison with observational 
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Fig. 1. — Top panel: PDF of X describing true masses, with a = 0, m m i n = 0.02 and m ma x = 22 Mj. Bottom panel: PDF of Y describing 
sini corrections, with i values drawn from a uniform distribution in the range [5.7,90.0] degrees. 

data. In § [5] we look at the significance of the shape of the true mass distribution and how this may affect current 
models of planet formation and evolution. Finally, in §[7] we summarize the results. 



2. THE PROBLEM AND OUR CONCEPT 

The problem can be summarized as follows: given an analytical model of the distribution of true planetary masses, is 
it possible to obtain the distribution of minimum-masses analytically by assuming a random distribution of inclination 
angles? The answer to this question is yes, and relies on computing the probability density function (PDF) of the 
product of two random variables. The problem of finding the PDF of the product of two random variables was first 
solved by Rohatgi (1976) but its implementation relied on knowledge of the joint PDF of the two variables. In this 
paper we apply the approach followed by Glen et al. (2004), which uses the individual PDFs of the two random 
variables. 

To do so, let X, Y, and Z be random variables, such that 



X = M T (1) 

y = sini (2) 

Z = XY . (3) 

The above equations state that X takes the value of any exoplanetary mass, Mr, Y takes the value of any correction 
by inclination (viewing) angle i, sini, and Z takes the value of any possible product Afxsini. Let also fx, fy, 
and fz be their respective probability density functions (PDF). We wish to obtain fz, a prediction for the observed 
distribution of 'minimum' masses, given fx (an assumption for the true mass distribution) and fy (which can be 
calculated analytically). 

3. SOLUTION 
3.1. PDF of X 

Let fx represent the distribution of planet true masses. For simplicity, we assume fx is well described by a power- 
law. We define a power-law of index — (1 + a), such that it is normalized over the mass interval [m m i n , m max ], defined 
to be the minimum and maximum true masses. Note that a minimum non-zero mass avoids a diverging integral, which 
is a condition for fx to be normalized. Thus, fx becomes 

p { \ j f A X X~ a ~ X dx, < TOmin < X < TOmax , 

fx\x) ax — < n . (4) 

J w 10, otherwise, v ; 
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where x represents true masses, Ax = a/(m ? — m m ^,) is a normalization constant and a > — 1. If a — 0, then 
Ax = (In (m max /m m i n )) _1 . It is important to note that X can be treated as a continuous variable and that the 
present analysis does not require that masses be normalized (i.e., x can be in any units). An example of this PDF, 
representing the true mass distribution, is shown in the top panel of Fig. [TJ 



3.2. PDF of Y 

Consider randomly distributed inclination angles, i. The observed inclination angles can be shown to be distributed 
like sini, over the interval [0, This is straightforward to see in spherical coordinates (e.g., Ho & Turner 2011) and 
implies that higher inclination angles (edge-on) are more probable than lower ones (pole-on). 

To get fy for a sin i-distribution of angles, let y = sini. Then i = arcsiny, with i £ [0,5]. The corresponding 
cumulative distribution (CDF) is given by 1— cosi. Thus, the differential of this CDF, d(l — cos [i]) = d(l — cos [arcsiny]) 
gives f Y : 

Mv) d y = ^= dy, < y < 1. (5) 

V 1 -y 

The above equation considers all possible angles. However, it may be useful to define a minimum inclination angle, 
«min, to account for possible selection effects when comparing with real data. In this case fy becomes 

y 

fr(y) dy = A Y = dy, sini min < y < 1. (6) 

V 1 - V 



where Ay = 1/y 1 — sin (i m i n ) 2 is a normalization constant. An example of this PDF, representing the sin i distri- 
bution, is shown in the bottom panel of Fig. [T] 

3.3. PDF of Z 

Having defined fx and fy, the PDF of random variables X and Y, we can now calculate fz, the PDF of the product 
XY. Following Glen et al. (2004) 's solution for the PDF of the product of two continuous and independent random 
variables, and also considering the above boundaries, fz(z) can be expressed as 

z j sin i m i n 

/ fY(%)fx(u)~du, m min sini min < z < m min , 

m m i n 
z j sin i m i n 

fz( z ) = { J fy(z)f x (u)±du, m min < z < m max smi min , ( 7 ) 

Z 

Tina ax 

/ fY{^)fx{u)^du, TO max sini min < z < m max , 

z 

provided that m m ; n < m max sini m i n . This condition is the equivalent of setting a lower limit on i, such that pole-on 
orbits, producing very large corrections, are excluded from the observed sample. It also determines three 'validity 
regions'. 

Replacing Eq. [7] with Equations S] and |H1 after some algebra and getting rid of the integration limits for a moment, 
Eq. [7| reads: 



fz(z) = A x Ayz / du . (8) 

J vi — z 

For any value of a, the improper integral in Eq. [S]has a primitive in terms of 2-P1, the first hypergeometric function 
(Abramowitz & Stegun 1964). From Eq. [71 note also that one important property of fz is that it vanishes at the 
boundaries z = m m i n sini m i n and z = TO max , meaning that the observed mass distribution must have a peak. This has 
important consequences when interpreting distributions of real data (see § |6]) . 

In conclusion, the problem stated in § [U is formally solved for the distributions described in § 13.11 and § 13.21 Indeed, 
Eq. [JJis valid for any true mass distribution (i.e., not only for a power-law) but in general the evaluation of fz will 
require numerical integration. 



4. EXAMPLES 

We now evaluate numerically two examples of true mass distributions: a power-law and a log-normal distribution. 

For the power-law introduced in ij3.1l and used to derive Eq. [51 the case a = is easy to evaluate (and will allow us 
to perform a quick comparison with RV data in the next Section) . Replacing with a = and integrating Eq. [8j the 
predicted minimum-mass distribution becomes: 
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Fig. 2. — Top panel: power-law distribution of true masses (fx', red curve) with m m i n = 0.02 Mj and m max 
predictions of minimum-mass distributions (fz', blue curves) for various minimum inclination angles, i m i n = 

1 „„„ Qnx-^y 



Bottom panel: same as above but for a log-normal distribution of true masses, fx{ x ) 



cxp - 



= 22 Mj, and corresponding 
0.57, 5.7, and 30.0 degrees. 

with (j, = and cr = 0.5. 



fz{z)dz = x < 



\J (z j Sill2 m i n ) 2 — Z 2 \/ m min _ 22 

js/silli m i n *Tlmin 

(2:/ sin i m j n ) 2 -2 2 
z I sin i m i n 



dz, 



dz. 



dz, m min sin i min < z < m n 



m max smi min < z < m r 



(9) 



The function /x represents the predicted distribution of minimum-masses if the true mass distribution is proportional 
to Mf 1 . 

The top panel of Fig. [2] shows fz for various minimum inclination angles (blue curves; « m i n = 0.57, 5.7, and 30.0 
degrees). For reference, the underlying distribution of true masses, fx, is also shown (red curve), where we have set 
?7i m i n = 0.02 Mj and m max = 22 Mj. The effect of the sini correction is evident at both mass ends of the predicted 
distribution, as objects in any mass bin 'migrate' to lower-mass bins in the minimum mass distribution. 

However, the shape of the low-mass tail (z < m m i n ) is affected mostly by the inclusion of large sini corrections, i.e., 
lowly-inclined systems. This is readily seen from the fact that fz converges to fx in the limit sin« m ; n « 1. Thus, in 
general, the observed distribution of a true mass power-law distribution (with boundaries) should show a decrease at 
the low-mass end. 

The same effects are observed if a log-normal distribution is used instead of a power-law. The bottom panel of Fig. [2] 
shows such a distribution (same color codes for the true and predicted mass distributions and same i m i n values) , where 
Eq. [7] has been integrated numerically using 



fx(x) 



1 



XO~ V 2"7T 



exp - 



(In a; — /i) z 



2a 2 



(10) 



with fi = and a = 0.5. 

We conclude by emphasizing that setting a particular value of i r 



simulates observational selection effects, meaning 



that the model mimics observational samples which have missed objects at angles below that limit. Fig. [2] shows that 
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Fig. 3. — Observed distribution of minimum masses for a sample of 643 RV discovered exoplanets (data taken from http : //exoplanet . eu 
as of October 17 2011). Bin sizes were arbitrarily chosen to have a roughly equal number of systems. Overlaid is the model of minimum 
masses given by Eq . |91 and obtained from an assumed true mass distribution with the following parameters: /(Mx) <x M^ 1 , m m - lu = 0.02 
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22 Mj. The predicted minimum- mass distribution was set to have sini n 



0.1. The dashed lines mark the validity regions 



including those lowly-inclined systems, despite them being less probable, induces dramatic changes in the shape of the 
predicted distribution at the low-mass end. 
We now proceed to compare our models with RV data. 

5. COMPARISON WITH OBSERVATIONAL DATA 

Fig. [3] displays the observed distribution of minimum masses (Schneider et al. 2011; RV data on 643 planets as of 
October 17 taken from http://exoplanet.eul. Bin sizes were arbitrarily chosen to have a roughly equal number of 
systems. Overlaid is the model prediction of minimum masses given by Eq. [!JJ i.e., the prediction that results from 
assuming a true mass distribution of the form oc M^ 1 . We have chosen the following physical parameters for the 
true mass distribution: m mm = 0.02 Mj and m mm = 22 Mj, where the lower limit is chosen from observation and 
the upper limit marks the planetary-brown dwarf mass boundary (Sahlmann et al. 2011). To simulate observational 
selection effects, sini m i n = 0.1 was chosen. We emphasize that these parameters are not the result of a fit, but were 
adjusted manually to attempt a best match to the data. 

It can be seen that the power-law part of the predicted curve (Fig. [3]) only poorly fits the data. However, interestingly 
enough, both of its extremes do seem to better reproduce the data. 

At the low-mass limit (z < 0.02 Mj) the model describes mass-bins which are not 'occupied' in the true mass 
distribution. As already mentioned, the shape of this tail is affected mostly by the inclusion of large sini corrections, 
i.e., lowly- inclined systems, which in turn produce a decrease at the low-mass end of the obser ved distribution. Suc h 
a decrease mimics observational effects on the data like those ones induced by incompleteness (|Udrv fc Santos 2007f) . 
as we discuss further in next Section. 

At the high-mass end, there is no apparent reason for a systematic lack of observed systems. Here the data shows a 
decline which is also in good agreement with our model's prediction. However, the small number of systems precludes 
any firm conclusion. 

An interesting feature that appears in the data between the high and low-mass ends of the distribution is a deficit 
that is not well described by the power law model prediction. Around 0.1 Mj the data drop and then quickly turnover 
to rise again, creating a mass distribution paucity in the data. Given we expect this mass regime to be fairly well 
sampled by current RV surveys, this feature may indicate that more than a single power law is needed to describe the 
full ensemble of planetary masses. 
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We conclude by emphasizing that the above approach provides a direct comparison with observational data. As- 
suming the true mass distribution is at least fairly well described by a power-law, we have shown that our solution for 
the minimum-mass distribution provides a good fit to the observations at both mass ends. 

6. DISCUSSION 

We have shown that if the true mass distribution is described by a power-law with boundaries, there must be a 
peak in the observed mass distribution of exoplanets, and therefore this peak has implications for planet formation 
and evolution models. A peak in the planetary mass distribution tells us that most of the mass of the proto-planetary 
disk that goes into forming planets, gets locked up in the formation of the larger gas and ice giants. This agrees with 
the mass distribution of the planets in the solar system. 

The widely accepted p lanetary formation theory is by core accretion and subsequent planet migration (|Pollack 1984t 
iLin fe Papaloizou 19861 ) and this model can broadly explain the currently observed population of exoplanets. The peak 
in the planetary mass distribution needs to be taken into account when comparing the outcomes from core accretion 
formation models against the observed mass distribution of exoplanets, unless the true mass distribution changes form 
towards the lowest masses. 

One interesting question that arises is, what does the position of the peak in the mass distribution tell us and what 
does it mean? As we have seen in Fig. [3J our true mass distribution model can provide a good fit to the observed mass 
distribution of the current population of exoplanets, particularly the peak and subsequent decline. 

The low-mass peak we find that best describes the observed data is located around 0.02Mj, or ~6.5M e . When we 
look at systems that have high inclinations, like an observed mass distribution drawn from transiting planets only, 
we find that for a complete sample, the observed distribution follows the true distribution with no low-mass peak. 
Therefore, if we assume that the bins are complete below 0.02Mj, then this is the regime where we begin to observe 
the effects of systems with low inclination angles, and hence large mass corrections. 

Here we are assuming we have sampled all angles above a certain limit, i m i n , but our model considers the lower 
likelihood of observing systems with low inclinations; therefore this tells us something important that even a small 
fraction of systems with low inclinations can produce large changes in the observed mass distribution in the low-mass 
regime. This result is in line with the implications of the analysis by Ho & Turner (2011). These authors demonstrate, 
using Bayesian analysis, that the posterior distribution of angles is determined by the particular true mass distribution 
and so the latter cannot be simply obtained from the observed one. Unlike Ho & Turner, we deal here with the prior 
distribution of angles and study its effects on the assumed shape of the true mass distribution; however, both approaches 
lead to the conclusion that the low inclination systems modify the low-mass end of the observed distribution. 

Also, since most of the observed distribution above the 0.02Mj boundary broadly follows the true distribution then 
small to medium values of sini do not affect the overall mass distribution in a large manner. We make it clear that most 
of the low-mass systems in these bins are genuine rocky planets, but that the small numbers of low-inclination/high- 
mass interlopers cause a dramatic change to the observed mass distribution, again, assuming that the low-mass bins 
are complete. 

Finally, we do caution that the true mass distribution we are discussing here applies to planets with small semimajor 
axes (<4 AU's or so). The distribution of mass at ever increasing distances from the central star may change the 
shape of the true mass distribution, but fur ther analysis on th is issue is likely to require many more detections like 
the directly imaged planets around HR8799 (jMarois et al. 20081 ) or planets discovered by microlensing techniques (e.g. 
iMuraki et al. 20111 ). 

7. SUMMARY AND OUTLOOK 

We have applied the formal solution for the PDF of the product of two independent random variables to the 
observational problem of decoupling the sinz factor from an observed sample of exoplanet masses. Our approach 
requires that the true mass distribution is modeled by a continuous function that represents the PDF (within given 
physical or observational limits). 

We have shown that if the true mass function is modeled as a power-law, comparison with observed data of 643 
RV planets shows a good match with our method's prediction, specifically at both mass ends. In particular, the 
prediction agrees well with the decline observed toward the low-mass end, thus providing an alternative explanation 
to the turnover being the result of observational biases. 

If the low-mass bins below 0.02Mj are assumed to be complete, then we show that the presence of a small number of 
systems with low inclinations, and hence much larger true masses, heavily affects the distribution in this regime. Such 
effects are not seen above this mass region where the sini values are more modest and hence corrections are smaller, 
meaning the true distribution is being matched more closely. 

We also again note a mass paucity around 0.1 Mj in the observed mass distribution, as the single power law model 
does not describe this region very well at all. In fact, it may be prudent to examine the mass distribution using more 
than one function, like a double power law for instance. This may indicate that the mass distribution changes form 
below around 0.1 Mj, a feature we plan to study more in the future. 

In summary, we have provided a practical and intuitive method to decouple the sini effect that is inherent to RV 
samples. The method offers the possibility of introducing observational constraints in a straightforward fashion, in 
order to compare predictions with current observations. 

Current RV surveys are plagued with biases and incompleteness that affect conclusions drawn from analyzing any 
Doppler data set. For instance, the RV detection method is heavily biased towards the detection of more massive 
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companions on short period orbits, since they induce a larger reflex motion on the host star in comparison with less 
massive and longer period companions. Therefore, the detection of very low-mass pl anets is only now being fully realized 
and requires large data sets and novel detection /characterization methods (e.g. iVogt et al. 2010t iPepe et al. 20111 : 
lAnglada-Escude et al. 2 012; Jen kins et al. 2012T ). Hence, studying the low-mass end of the mass distribution is one of 
the corner-stones of exoplanet research at the present time and will continue to be so in the near future. 

The authors acknowledge the very helpful discussions with Hugh Jones and Raul Gouet, as well as the important 
feedback given by an anonymous referee. SL has been supported by FONDECYT grant number 1100214. JSJ 
acknowledges funding by FONDECYT through grant 3110004 and partial support from the Gemini-CONICYT Fund 
and from the Comite Mixto ESO-Gobierno de Chile. Wolfram Mathematica online integrator was used. 
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